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DESCRIPTION 
DIDEOXY DYE TERMINATORS 

FIELD OF THE INVENTION 
This invention relates to dye terminator nucleic 
acid sequencing and reagents for such sequencing. 

BACKGROUND OF THE INVENTION 
5 The following is a discussion of the relevant art, 

none of which is admitted to be prior art to the 
appended claims. 

Sequence reaction products must be labeled. This 
can be done using labeled primers, labeled nucleotides 
10 (usually radioactive dNTPs) or labeled ddNTP 

terminators. The use of labeled terminators has the 
advantage of leaving false-stops undetectable. 

DNA sequence bands do not necessarily have uniform 
intensities. It is useful to express band intensity 
15 variability numerically. This can be done by reporting 
the ratio of maximum to minimum intensity of nearby 

■ 

bands (within a window of perhaps 40 bases) in a DNA 
sequence or, with normalization and correction for 
systematic "drift" in intensity by reporting the root 
20 mean square of band intensities (typically peak 

heights) (Fuller, C.W., Comments 16(3) :l-8, 1989). It is 

* 

advantageous to have uniformity of band intensity as 
sequence accuracy and read- length is improved with, bands 
of more uniform intensity. 
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For accurate reading, the mobility of any given 
sequencing reaction product must migrate through the 
electrophoresis gel with a speed proportional only to 
its length. Products which migrate faster or slower 
5 than normal for a given length will result in sequence 
ambiguities or errors known as "compressions n . 

Anomalous migration speed can be caused by 
secondary structure of the DNA and is apparently the 
cause of most ^compression" artifacts seen in 
10 radioactive-label (and other) sequencing experiments at 
GC-rich regions. These can often be resolved by the use 
of analogs of dGTP such as 7-deaza-dGTP or dITP. 
Another compression-like artifact is observed when some 
dye-labeled ddNTPs are used for sequencing. Several 
15 examples of this can be seen in Lee, L.G., Connell, 
C.R., Woo, S.L., Cheng, R.D., McArdle, B.F., Fuller, 
C.W., Halloran, N.C., and Wilson. R.K., Nucleic Acids 
Res., 20:2471-2483, 1992 (see figures 4g, 4h and 6h 
using ddCTP labeled with tetramethylbodipy and TMR or 
20 ddGTP labeled with bif luor) . These compression-like 
artifacts are produced, even in sequences which are 
compression-free when sequenced radioactively or with 
dye-labeled primers. These artifacts can sometimes be 
eliminated by substituting dITP for dGTP or alpharthio 
25 dNTPs for normal dNTPs (Lee, L.G. et al., Nucleic Acids 
Res., 20: 2471-2483, 1992; U.S. Patent No. 5,187,085). 
Similar artifacts seen with the fluorescein dye- labeled 
ddNTPs sold by Applied Biosystems for dye -terminator 
sequencing with T7 DNA polymerase are resolved by 
30 substituting alpha -thio dNTPs for normal dNTPs (Lee, 
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L.G. et al., Nucleic Acids Ree., 20: 2471-2483, 1992; 
U.S. Patent No. 5,187,085). 

Prober, J.M., Trainor, G.L., Dam, R.J., Hobbe, 
F.W., Robertson, C.W., Zagursky, R.J., Cocuzza, A.J., 
5 Jensen, M.A. and Baumeister, K., Science 238:336-41 
(1987) performed sequencing using terminators labeled 
with substituted succinyl- fluoresceins with linkers of 
10 atoms in length, together with dATP, dCTP, dTTP, 7- 
deaza-dGTP and AMV reverse transcriptase, and a 
10 fluorescence-detecting instrument. From Fig. 6 of this 
paper is clear that overall band intensities varied by 
more than 10 -fold, far more than the best available 
current methods with dye primers or radioactive labels. 
Dideoxy NTP terminators that have the same basic 
15 structure as the Prober et al. (1987) terminators, but 
have four rhodamine dyes used in place of the succinyl 
fluoresceins and linkers of 5 atoms in length, have been 
used for sequencing with Taq polymerase . In order to 
use these terminators, dITP is used in place of dCTP or 
20 7-deaza-dGTP to eliminate severe n compression « 

artifacts. This method has been practiced using cloned 
Taq DNA polymerase (Bergot, WO 9105060; Parker, L.T., 
Deng, Q, Zakeri, H., Carlson, C. Nicker son, D.A., Kwok, 
P.Y., Biotechnigues 19 (1) :116-121, 1995) and with a 
25 mutant of Taq polymerase (D49G, AnpliTaq CS) lacking 5'- 
3' exonuclease activity. However, band intensities vary 
by as much as 20-fold, limiting the accuracy and read- 
length possible with the method (Parker, L.T., Zakeri, 
H., Deng, Q. , Spurgeon, S. , Kwok, P.Y., Nickerson, D.A., 
30 Biotechnlgues 21(4) i694-699, 1996). 
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Lee, L.G., Connell, C.R., Woo, S.L., Cheng, R.D. , 
McArdle, B.F. , Fuller, C.W., Hallorand, N.D. and Wilson, 
R.K. , Mzcleic Acids Rea., 20: 2471, 1992) describe 
sequencing with a set of ddNTP terminators and T7 DNA 
5 polymerase. All have fluorescein- type dyes attached to 
the ddNTPs in essentially the same manner as the 
rhodamine terminators used for Taq sequencing. These 
are used with modified T7 DNA polymerase (Sequenase™ 
version 2,0) and alpha- thio dNTPs . The thio dNTPs are 
10 used to resolve the "compression" artifacts like dITP is 
used for the Taq dye -terminator methods. The results 
with this system are such that bands vary in intensity 
about 10-fold. 

Wayne Barnes has published a protocol for dye- 
15 terminator sequencing with FY modified polymerases and 
Mn 2 * (Scientech Corp. St. Louis, MO). Bands are more 
uniform with this method varying about 4.5-fold at most. 

Fluorescein- 12 ddNTPs that have a linker length of 
12 atoms and Biotin-11 ddNTPs that have a linker length 
20 of 11 atoms are available (Dupont NEN, Wilmington, DE) . 
These labeled ddNTPs are described as useful in 
sequencing reactions. 

ABI PRISM disclose dichlororhodamine dyes linked to 
terminators by propargyl /ethylene oxide/amino ("BO") 
25 linkers eight atoms in length for sequencing (Rosenblum, 
B.B., Lee, L.G., 8purgeon, S.L., Khan, S.H., Menchen, 
S.M., Heiner, C.R., and Chen, S.M. , Nucleic Acids Rea. 

25(22) -.4500-4504, 1997). 
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Cyanine dyes have been utilized in dye terminators 
for sequencing (Lee et al., Nucleic Acids Re8. t 
20(10) :2471, 1992) . 



SUMMARY OF THR INVENTION 

5 The present invention provides novel dideoxy dye- 
labeled terminators which are useful in a number of 
biological processes, including providing uniform band 
intensities and the resolution of dye- induced 
compression artifacts in DNA sequencing. The dideoxy 
10 dye-labeled terminators of the present invention are 
particularly well suited for use with DNA polymerases 
that are thermostable or which contain an altered dNMP 
binding site (Tabor et al., U.S. Patent No. 5,614,365). 
Use of the terminators of the present invention for 
15 sequencing does not require the use of nucleotide 
analogs such as dITP or alpha- thio nucleotides to 
eliminate dye-induced compression artifacts. Applicant 
has surprisingly found that there is a strong 
correlation between the length of the link between the 
20 dye molecule and the nucleotide and band uniformity, but 
little correlation between the type of dye (or other 
parameters) and band uniformity. Dye terminators with 
linkers of 10 or more atoms (extended linkers) up to 25 
atoms (10, 11, 12 25) when used in sequencing 

* 

25 reactions produce bands in sequencing gels of 

significantly -improved uniformity compared with dye 
terminators with linkers less than 10 atoms. 

* 

The dye termininators of the present invention with 
extended linkers typically are provided in groups of 
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four (ATGC) with or without a thermostable DNA 
polymerase and are especially useful in a method of 

sequence analysis* 

In a first aspect, the invention features a kit for 
5 DNA sequencing having a first, second, third and fourth 
dye terminator molecule, each of the dye terminator 
molecules has a dye molecule, a linker of at least 10 
atoms in length and either ddATP, ddCTP, ddOTP or ddTTP 
as a mono or tri -phosphate and a thermostable DNA 

10 polymerase. 

By *dye molecule" is meant any molecule that has a 
detectable emission spectrum, including but not limited 
to fluorescein, rhodamine, texas red, eosin, lissamine, 
coumarin, cyanide, and derivatives of these molecules, . 

15 Dyes also include energy transfer dyes each comprising a 
donor and an acceptor dye . 

By linker* is meant a chain of at least 10 atoms 
comprising carbon, nitrogen, and oxygen which links the 
dye molecule with the dideoxynucleotide. The chain may 

20 also contain substituted carbon or sulfur. Linkage 
typically occurs at the aromatic base moiety of the 
nucleotide. The first two atoms of the linker attached 

■ 

to the base are typically joined in a triple bond. 

* 

By ^substituted carbon * is meant that one or more 
25 hydrogens are replaced with a substitute group such as, 
but not limited to, hydroxy 1, cyano, alkoxy, oxygen, 
sulfur, nitroxy, halogen, -N(CH,) 2 , amino,. and -SH. 

By "thermostable DNA polymerase* is meant a DNA 
polymerase has a half -life of greater than 5 minutes at 
30 ao°C. Such polymerases include, but are not limited to, 
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DNA polymerases encoded by Thermus aguaticus, ThezmuB 
thermophiluB, Thermus flaws, Thermococcua littoralle, 
Pyrococcua fxxrioBxxs, Thermo toga maritima, and Therxnotoga 
neapolitana. 

5 In a preferred embodiment the thermostable DNA * 

polymerase has an altered dNMP binding site so as to 
improve the incorporation of dideoxynucleotides relative 
to the natural polymerase. A DNA polymerase with an 
altered dNMP binding site does not discriminate 

10 significantly between dideoxynucleotides and 

* 

deoxynucleotides. The chance of incorporating a 
dideoxynucleotide is approximately the same as that of a 
deoxynucleotide or at least 1/10 the efficiency of a 
deoxynucleotide . 
15 In a second aspect the invention features a 

compound of formula (I) 



1 



A is a cyanine dye of the structure 

Rl "2 

wherein the curved lines represent carbon atoms 
necessary for the formulation of cya nine dyes; X and Y 
20 are selected from the group consisting of 0, 6, and 
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CH 3 -C-CH 3 ; m is an integer selected from the group 
consisting of 1, 2, 3, and 4; Rl, R2, R3, R4, R5, R6, 
and R7 are independently selected from the group 
consisting of H # OH, C0 2 H, sulfonic acid or sulfonate 

5 groups, esters, amides, ethers, alkyi or aryl groups, 
and B and one Rl, R2, R3, R4, R5, R6 or R7 is B. 

B is a linker of at least 10 atoms in length 
wherein the atoms are selected from the group consisting 
of carbon, nitrogen, oxygen, substituted carbon and 

10 sulfur and the linker is attached at one end to A and at 

the other end to C, 

C is a dideoxynucleotide selected from the group 

consisting of: 




and wherein the linker is covalently bonded to the 
15 dideoxynucleotide at position 7 for the purines (ddG, 
ddA) and at position 5 for the pyriraidines (ddT, ddC) 
and wherein r is a mono or trl -phosphate. 

The term Sulfonic acid or sulfonate groups* refer 
to S0jH groups or salts thereof. 
20 The term * ester* refers to a chemical moiety with 

formula - <R) n-COOR' , where R and R' are independently 
selected from the group consisting of saturated or 
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unsaturated alkyl and f ive-membered or six-membered aryl 
or heteroaryl moieties and where n is 0 or 1. 

The term *amide" refers to a chemical substituent 
of formula -NHCOR, where R is selected from the group 

■ 

5 consisting of hydrogen, alkyl, hydroxyl, and five- 
membered or six-membered aryl or heteroaryl ring 
moieties, where the ring is optionally substituted with 
one or more substituents independently selected from the 
group consisting of alkyl, halogen, trihalomethyl , 
10 carboxylate, nitro, or ester. 

The term *ether" refers to a chemical moiety with 
formula R-O-R' where R and R' are independently selected 
from the group consisting of saturated or unsaturated 
alkyl and f ive-membered or six-membered aryl or 
15 heteroaryl moieties and where n is 0 or l. 

The term * alkyl" refers to a straight -chain or 
branched aliphatic hydrocarbon. The alkyl group is 
preferably 1 to 10 carbons, more preferably a lower 
alkyl of from 1 to 7 carbons, and most preferably 1 to 4 
20 carbons. Typical alkyl groups include methyl, ethyl, 
propyl, isopropyl, butyl, isobutyl, tertiary butyl, 
pentyl, hexyl and the like. The alkyl group may be 
substituted and some typical alkyl substituents include 
hydroxy 1 , cyano, alkoxy, oxygen, sulfur, nitroxy, 

* 

25 halogen, -N(CH 3 } 3 , amino, and -SH. 

The term *aryl" refers to an aromatic group which 
has at least one ring having a conjugated .pi electron 
system and includes both carbocyclic aryl (e.g. phenyl) 
and heterocyclic aryl groups (e.g. pyridine). The term 

30 "carbocyclic" refers to a compound which contains one or 
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» 

more covalently closed ring structures , and that the 
atoms forming the backbone of the ring are all carbon 
atoms. The term thus distinguishes carbocyclic from 
heterocyclic rings in which the ring backbone contains 

« 

5 at least one atom which is different from carbon. The 
term *heteroarly" refers to an aryl group which contains 
at least one heterocyclic ring. 

In a preferred embodiment the linker is selected, 
from the group consisting of; 

4 

10 -CbC-CHj-NH-CO- (CH 3 ) 5-nh-co- , 
.CEC-CH a -NH-CO- (CH 3 ) ,-NH-SO a - , 
-CeC-CH a -NH-CO- (CH a ) „-NH-CO- , 
-CeC-CHa-KH-CO- (CH a ) , 

-CeC-CHa-NH-CO-(CHa)$-NH-CO-(CHa)5-# and 
15 -CBC-CHa-NH-CO- (CHa) e-NH-CO- (CHj) l0 -NH-CO- 

In preferred embodiments the dideoxy dye 
terminators are; a compound of the formula (II) : 
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;a compound of the formula (III) : 




i j 
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/compound of the formula (V) : 




The Cy-5.5 ddQTP and ddCTP compounds have a linker 
5 of 10 atoms in length. The Cy-5.5 ddCTP and ddTTP 
compounds have a linker of 17 atoms In length. 

In a third aspect the invention features a 
deoxyribonucleic acid sequence containing the compound 

* 

of formula I, II, III ,IV or V. 
10 In a preferred embodiment the invention features a 

kit for DNA sequencing comprising compounds of formula 

* 

II, III, IV, and V. 
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In a further preferred embodiments the kit further 
has a thermostable DNA polymerase; the thermostable DNA 
polymerase has an altered dNMP binding site so as to 
improve the incorporation of dideoxynucleotides relative 
5 to the natural polymerase. 

Applicant has surprisingly found that the one 
parameter that most strongly correlates with band 
uniformity is the length of the linker between the dye 
and the ddNTP 1 . Applicant has found that by extending 
10 the linker length between the dye and the nucleotide for 
any dye : ddNTP combination to at least 10 atoms, that 
band uniformity is substantially improved and there are 
no dye-induced compression artifacts. 

Thus, in a fourth aspect, the invention features a 
15 method for determining the nucleotide base sequence of a 
DNA molecule consisting of the steps of incubating a DNA 
molecule annealed with a primer molecule able to 
hybridize to the DNA molecule in a vessel containing a 
thermostable DNA polymerase, a dye terminator with a 
20 linker of at least 10 atoms between the dye and the 

nucleotide and separating DNA products of the incubating 
reaction according to size whereby at least a part of 
the nucleotide base sequence of the DNA molecule can be 
determined. 

25 In preferred embodiments, the dye terminator is a 

compound of formula I, II, III, IV or V; the 
thermostable DNA polymerase has an altered dNMP binding 
site. 
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Other features and advantages of the invention will 
be apparent from the following description of the 
preferred embodiments thereof, and from the claims. 

All articles, publications and patents cited in 
5 this application are hereby incorporated by reference, 
in their entirety, 

BRIEF PPSCRXPTJQN QF THE FIGURES 
Fig. 1 presents DNA sequence data generated using 
M13mpl8 containing a 115 bp SauAI fragment from lambda 
10 inserted a the BamHI site and Cy5.5 ddGTP, ddATP, ddTTP, 
and ddCTP dye terminators. 

Fig. 2 is a graph of band intensity variability 
(rms) vs linker length (atoms) . 



DESCRIPTION OF THE PREFERRED EMBODIMENTS 

15 The following Examples are provided for further 

illustrating various aspects and embodiments of the 
present invention and are in no way intended to be 
limiting of the scope. 

■ 

Example 1: Synthesis of dideoxy dve terminators 

20 Cy 5,5 dideoxynugleoside triphosphates 

Dye terminators labeled with Cy5.5 were prepared 
from propargylaminodideoxynucleot ids (Prober, J.M., 
Trainor, G.L. , Dam, R.J., Hobbs, F.W., Robertson, C.W., 
Zagursky, R.J. / Cocuzza, A.J., Jensen, M.A. and 
25 Baumeister, K. , Science 238 : 336 -41 (1987); U.S. Patent 
Nos. 5,242,796, 5,306,618, and 5,332,666) and "CyDye 
Fluorolink Cy5.5 mono reactive dye* product PA25501 
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(Amersham Life Science) to produce compounds II, III, 
IV, and V. In the case of ddG and ddA, the 
propargylaminonucleotide was directly reacted with the 
N-hydroxysuccinimidyl ester of the Cy5.5 dye. In the 
5 case of ddC and ddT, a longer linker was constructed by 
reacting the propargylaminonucleotide with the N- 
hydroxysuccinimidyl ester of N-trifluoroacetyl-6- 
aminocaproic acid followed by hydrolysis in aqueous 
ammonia of the trif luoroacetyl group. The resulting 
10 compound was then reacted with the N-hydroxysuccinimidyl 
ester of the Cy5.5 dye to give the 17-atom linker 
between the Cy 5.5 dye and the pyrimidine base. 

In addition to Cy 5.5 dyes, those who practice the 
art would know how to identify and utilize other dyes, 
15 including other cyanine dyes, with the appropriate 
optical properties. Also, the construction and 
attachment of various linkers is well known in the art. 
Suitable reagents for linker construction include one or 
more compounds consisting of activated forms of amino- 
20 protected alkyl or aryl amino acids such as compounds of 
the formula R-NH- (CH 2 ) n -C0 2 R' or R-NH- (CH 2 )^(CH a ) o -C0 2 R' , 
where R is an acid- or base-labile protecting group , R ' 

* 

is a reactive ester or anhydride group, X is aryl, O, S, 
or NH, and where n and m are 0-12. Other linkers 

25 constructed by N- or 0- or S- alkyl at ion are also 
suitable. The exact linker length, of at least 10 
atoms, for a specific dye and dideoxynucleotide 
combination can be determined enpirically by monitoring 
band uniformity in DMA sequencing as described (see 

30 Example 3) . 
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Example 2: Dye terminator cycle sequencing 

DNA cycle sequencing was carried out using Thermo 
Sequenase™ DNA polymerase (Amersham, Cleveland/ OH) and 
Cy5.5 dideoxy dye terminators usingj the following cycle 
5 sequencing protocol: 

1. A master mix was prepared consisting of the 
following: 





Template DNA 


5.0nl 




10X Reaction buffer (see below) 


3.5/zl 


10 


Primer, 2/M 


1.0/jtl 




Polymerase (see below) 


2(1 




H 2 0 


15 . 5/zl 




Total volume 


27 . 0/il 




10X Reaction Buffer: 




15 


150 mM Tris HCL pH 9.5 






35mM MgCl 2 





Polymerase: Thermo Sequenase™ DNA polymerase, 
lOU//xl, 0.0017U/fil, Thermoplaema acidophilus inorganic 
pyrophosphatase: 20mM Tris-HCl, pH 8.5, ImM DTT, O.lmM 
20 EDTA, 0.5% Tween-20, 0,5% Nonidet P-40 and 50% glycerol. 
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2. Four microcentrifuge tubes were labeled and 2 fil of 
Cy5.5 labeled ddG, ddA, ddT, ddC solution was added to 
each tube. 

25:1 ddG Mix, 300 fM each of dGTP, dATP, dTTP & dCTP, 
5 12 fM Cy5.5 ddGTP 

25:1 ddA Mix, 300 fM each of dGTP, dATP, dTTP & dCTP, 
12fM Cy5.5 ddATP 

25:1 ddT Mix, 300 fM each of dGTP, dATP, dTTP & dCTP, 
12/M Cy5.5 ddTTP 
10 25:1 ddC Mix, 300 fM each of dGTP, dATP, dTTP & dCTP, 
12/zM Cy5.5 ddCTP 

3, Six fxl of the master mix (from step 1) was 
aliquot ed to each of the 4 tubes from step 2 above. 
Cycling was carried out as follows: 95°C (30 sec), 45- 

15 55°C (30 sec) and 72°C (60 sec) for 35 cycles then 
incubate at 72 °C 5-7 minutes. 

4, One fil of 8M ammonium acetate was added to each 
tube. Then 27 fxl (approximately 3 times the reaction 
volume) of chilled 100% ethanol was added. Then mixture 

20 was mixed and placed on ice for 20 minutes to 
precipitate the DNA. 

5, The mixture was centrifuged in a microcentrifuge 
(~12,000rpm) for 20-30 minutes at either room 
temperature or 4°C. The supernatant was removed and 

25 then 200 fil of 70% ethanol was added to wash the DNA 
pellet. 



WO 99/40223 



PCT/US99/02104 



19 

6. The mixture was again centrifuged for 5 minutes, 
the supernatant removed and the pellet dried (in a 
vacuum centrifuge) for 2-3 minutes. 

7. Each pellet was resuspended in 6 ^1 of formamide 

■ 

5 loading dye (Amersham, Cleveland, OH) , vortexed 
vigorously (10-20 sec) to ensure that all DNA was 
dissolved. The mixture was briefly centrifuged to 
collect the sample at the bottom of the tube. 

8. Samples were heated to 70°C for 2-3 minutes to 
10 denature the DNA, then placed on ice. 

9. Then 1.5-2 fxl of the volume was loaded onto a lane 
of the sequencing gel, and the gel run on the MICRO Gene 
Blaster instrument (VGI) . 

For this sequence, the template DNA was M13mpl8 
15 containing a 115 bp Sau3AI fragment from bacteriophage 
lambda inserted at the BamHI site (product number US 
70171 Amersham) . The primer is the -40 Forward 23-mer 
univer sal primer (5 • - QTTTTCCCAQTCACGACGTTGTA- 3 1 ) (SEQ. 
ID. NO. 1) . Results are shown in Figure 1. 

20 Example 3; royrelation of linker length and band 

intensity variability 

Sequencing reactions were carried out as described 
in example 2 with various dye molecules linked to 
dideoxynucleotides with linkers of various lengths (see 
25 Table 1) . The labeled DNA products were then separated 
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on denaturing polyacrylamide gels and the labeled 

■ 

products were detected by fluorescence. The Intensity 
of the bands is taken as the height of the peaks in a 
graph of fluorescence (in arbitrary units) against time. 

5 Typically, systematic variations in peak heights can be 
seen in graphs of peak heights plotted sequentially. 
These systematic variations in the peak heights can be 
modeled by least -square fitting to a second-order 
polynominal function. Dividing the peak height for each 

10 band by the value of the curve -fit polynomial function 
yields a normalized band intensity for each peak. 
Variation in these band intensities can be expressed as 

the square root of the variance ^(nX3x 2 -(Ex) 2 /n 2 ) of the 

normalized peak heights, which can typically have values 
15 between 0 and 1 with more variability represented by 

higher numbers (Puller, C.W., Comments 16 (3) : 1-8, 1989). 
This value is numerically equal to root -mean- square 
(RMS) value when 1.0 is subtracted from the normalized 
peak heights. These values are reported in Table 1 and 
20 graphed in Fig. 2. Variability of band intensities is 
significantly reduced when linkers of 10 or more atoms 
in length were used, resulting in sequence data that was 
easier to interpret accurately. 



Table 1 



25 





Base . 


Dye* 


Linker 
Length 1 * 


Band Uniformity 

* * 

(rms) 


1 


T 


Coumarin 


5* 


0.32 


2 


0 


Lies amine 


5 d 


0.77 
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3 1 


0 1 


R110 1 


5 C j 


0.34 | 




4 


A 1 


R6G 1 


5° 1 


0.32 | 




5 1 


0 


R6G 1 


5 e j 


0.57 | 




6 1 


C 


ROX 1 


5° J 


0.36 | 


5 


7 I 


T I 


TMR 1 


5 C j 


0.47 | 




8 1 


A 1 


TxR 1 




0.61 | 




9 1 


C 1 


Eos in 


6 e I 


0.40 1 




10 1 


G 1 


Cy3 1 


10 l J 


0.24 1 




11 1 


A 1 


Cy5 | 


10 1 


0.15 | 


10 


12 1 


0 1 


Cy5 1 


10 l | 


0.21 j 




13 J 


A 1 


Cy5.5 J 


10 1 J 


0.21 | 




14 1 


G 1 


Cy5.5 J 


10 1 J 


0.20 | 




15 J 


A 


F1 1 


1 121 1 


0.16 j 




16 


C 


Fl 1 


12' 1 

■ 1 


0.20 | 


15 


i 17 


G 


1 F1 1 


1 12 ' 1 


1 °' 17 1 




18 


T 


1 F1 1 


1 12 ' 1 


1 0.18 1 




1 19 


1 A 


1 R6G 1 


1 12 ' 1 


1 0.13 




1 20 


1 T 


1 R6G 


| 12* 


| 0.25 | 




1 21 


1 A 


I ROX 


| 12* 


| 0.21 j 


20 


1 22 


I T 


1 ROX 


| 12* 


| 0.16 | 




1 23 


1 C 


| TMR 


1 12* 


1 0.26 | 




I 24 


1 ° 


1 . TMR 


| 12 f 






1 25 


1 T 


1 TMR 


| 12' 


1 0,37 1 




1 26 


I A 


1 TxR 


1 16* 


| 0.32 | 


25 


1 27 


1 C 


1 TXR 


1 16» 


| 0.24 | 




1 26 


1 G 


1 TxR 


1 160 


| 0.22 | 




I 29 


1 U 


1 TXR 


| 16« 


1 0-24 1 




1 30 


1 A 


1 C/3-CyS 


1 17^ 


1 °* 1X 1 




1 31 


1 0 


j Cy3-CyS 


| 17* 


1 0,16 1 


30 


| 32 


1 0 


| Cy3-CyS 


1 17* 


| 0.22 | 
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33 


T 


Cy3-Cy5 


17* 


0.11 


34 


C 


Cy5 


17* 


0.14 


35 


T 


Cy5 


17* 


0.10 


36 


C 


Cy5.5 


17* 


0.20 


37 


T 


Cy5.5 


17* 


0.18 


38 


A 


Fl 


17* 


0.16 


39 


C 


Fl 


17 h 


0.24 


40 


0 


Fl 


17 h 


0.18 


41 


T 


Fl 


17 h 


0.25 


42 


T 


Fl 


24 k 


0.24 



10 



15 



• Abbreviations for dyes: Fl, Carboxyfluorescein; R110, Rhodamine 
110; R6G, Rhodamine 6G; ROX, Rhodamine X; TMR, te tramethyl rhodamine ; 
TXR, Texas Red (Molecular Probes). The dyes Cy3, Cy3.5, Cy5 and 
Cy5.5 were from Amersham Life Science, Cleveland, OH. 

b Linker length is the number of atoms between the ring structure of 
the nucleoside base (A, C, G or T) and the ring structure of the 
dye. 
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Linker structures 

-CeC-CHa-NH-CO- 
-CBC-CHj-NH-SO a - 

-CeC-CHa-NH-CS-NH- 
-CeC-CHj-NH-CO- (CHj) 8 -NH-CO- 
-CHC-CHa-HH-CO- (CH a ) ,-NH-SO a - 
-C*C-CHa-HH-CO- (CH a ) w-NH-OO- 
-CeC-CHa-HH-CO- (CHa) s' 
-CbC-CH 3 -NH-CO- (CH a ) 5 -NH-CO- (CH a ) 5 - 
-CeC-CHj-NH-CO- (CH a ) 5 -NH-CO- (CH 3 ) 10 -NH-CO 
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CTAIMS 



1. A kit for DNA sequencing comprising: 
a first, second, third and fourth dye terminator 
molecule, each of the dye terminator molecules comprising 
5 a dye molecule, a linker of at least 10 atoms in length 
and either ddATP, ddCTP, ddGTP or ddTTP as a mono or tri- 
phosphate and a thermostable DNA polymerase. 



2. . The kit of claim 1, wherein said polymerase is 
a thermostable DNA polymerase that has an altered dNMP 
10 binding site so as to improve the incorporation of 
dideoxynucleotides relative to the natural polymerase. 



3. A compound of formula (I) : 



1 



15 wherein A is a cyanine dye of the structure 




and the curveql lines represent carbon atoms necessary for 
the formulation of cyanine dyes, X and Y are selected from 
the group consisting of 0, 8, and 
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CHj-C-CHa, m is an integer selected from the group 
consisting of 1, 2, 3, and 4, Rl # R2, R3, R4, R5, R6 and 
R7 are independently selected from the group consisting of 
H, OH, C0 2 H, sulfonic acid or sulfonate groups, esters, 
5 amides, ethers, alkyl or aryl groups and B, and one Rl, 

R2, R3, R4, R5, R6 or R7 is B ; 

■ 

B is a linker of at least 10 atoms in length wherein 
the atoms are selected from the group consisting of 
carbon, nitrogen, oxygen, substituted carbon, and sulfur 
10 and the linker is attached at one end to A and at the 
other end to G; and 

C is a dideoxynucleotide selected from the group 
consisting of 




wherein said linker is covalently bonded to said 
15 dideoxynucleotide at position 7 for ddA and ddG and at 
position 5 for ddC and ddT and wherein r is a mono or tri- 

♦ 

phosphate . 
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4. The compound of claims 3, wherein said linker is 
selected from the group consisting of 

-CEOCH a -NH-CO- (CH a ) 5 -NH-CO- , 
-CbC-CHj-NH-CO- (CH a ) t -NH-SO a - , 
5 -CeC-CH a ~NH-CO- (CH a ) u-NH-CO- , 
-CEC-CH a -NH-CO- (CH a ) f - , 

-CeC-CHa-NH-CO- (CHa) 5-NH-CO- (CH a ) and 
. -CeC-CHa-NH-CO- (CH 3 ) 5 -NH-CO- (CH a ) 10-NH-CO- . 

5* A compound of the formula (II) : 
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6. A compound of the formula (III) : 
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o- o- 



o- 




9. A deoxyribonucleic acid sequence containing the 
5 compound of formula I. 

i 

10. A deoxyribonucleic acid sequence containing the 
compound of formula II, III, XV, or V. 

11. A kit for DNA sequencing comprising compounds of 
10 formula II, III, IV, and V. 
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12. The kit of claim 11, further comprising a 
thermostable DNA polymerase. 

13. The kit of claim 12, wherein said polymerase is 
a thermostable DNA polymerase that has an altered dNMP 

5 binding site so as to improve the incorporation of 
dideoxynucleotides relative to the natural polymerase. 

14. Method for determining the nucleotide base 
sequence of a DNA molecule comprising the steps of: 

incubating a DNA molecule annealed with a primer 
10 molecule able to hybridize to said DNA molecule in a 

vessel containing a thermostable DNA polymerase, one 
of a set of four dye terminators with an linker of at 
least 10 atoms between the dye and the nucleotide and 
separating DNA products of the incubating 
15 reaction according to size whereby at least a part of 

the nucleotide base sequence of said DNA molecule can 
be determined. 

15. Method for determining the nucleotide base 
sequence of a DNA molecule comprising the steps of : 
20 incubating a DNA molecule annealed with a primer 

molecule able to hybridize to said DNA molecule in a 
vessel containing a thermostable DNA polymerase, a 
compound of formula I and 

4 

separating DNA products of the incubating 
25 reaction according to size whereby at least a part of 

the nucleotide base sequence of said DNA molecule can 
be determined. 
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16. Method for determining the nucleotide base 
sequence of a DNA molecule comprising the steps of: 

incubating a DNA molecule annealed with a primer 
molecule able to hybridize to said DNA molecule in a 
5 vessel containing a thermostable DNA, a compound of 

formula II, III, IV, or V and 

separating DNA products of the incubating 
reaction according to size whereby at least a part of 
the nucleotide base sequence of said DNA molecule can 
10 be determined. 

17. The method of any of claims 14, 15, or 16 
wherein said polymerase is a thermostable DNA polymerase 
that has an altered dNMP binding site so as to improve the 
incorporation of dideoxynucleotides relative to the 

15 natural polymerase. 
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FIGURE 2 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION : 



(I) APPLICANT : 



Kumar, Shlv 
Nampal 1 i , S t ayatn 
McArdle, Bernard F. 
Fuller, Carl W. 



(ii) TITLE OF INVENTION 



DIDEOXY DYE TERMINATORS 



(iii) NUMBER OF SEQUENCES: 



(iv) CORRESPONDENCE ADDRESS 

(A) ADDRESSEE : 

(B) STREET: 

(C) CITY: 

(D) STATE: 

(E) COUNTRY: 

(F) ZIP: 



Lyon & Lyon 

633 West Fifth Street 

Suite 4700 

Los Angeles 

California 

U.S.A. 

90071-2066 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 

(B) COMPUTER: 

(C) OPERATING SYSTEM: 

(D) SOFTWARE: 



3.5" Diskette, 1.44 Mb 



IBM Compatible 
IBM P.C. DOS 5.0 
FastSEQ for Windows 2.0 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



To Be Assigned 
Herewith 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: 

(B) REGISTRATION NUMB ER: 

(C) REFERENCE/DOCKET NUMBER: 



Warburg, Richard J 

32,327 

225/219 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE : 

(B) TELEFAX: 

(C) TELEX: 



(213) 489-1600 
(213) 955-0440 
67-3510 



(2) INFORMATION FOR SEQ ID NO: It 



(I) SEQUENCE CHARACTERISTICS : 



(A) LENGTH : 

(B) TYPE: 

(C) STRAND EDNESS : 

(D) TOPOLOGY: 



23 base pairs 
nucleic acid 
single 
linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GTTTTCCCAG TCACGACGTT GTA 



23 



Other embodiments are within the following claims. 
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